Method and apparatus for detecting malicious code in an information handling system

ABSTRACT

A method for detecting malicious code on an information handling system includes executing malicious code detection code (MCDC) on the information handling system. The malicious code detection code includes detection routines. The detection routines are applied to executable code under investigation running on the information handling system during the execution of the MCDC. The detection routines associate weights to respective executable code under investigation in response to detections of a valid program or malicious code as a function of respective detection routines. Lastly, executable code under investigation is determined a valid program or malicious code as a function of the weights associated by the detection routines. Computer-readable media and an information handling system are also disclosed.

This application is a divisional of U.S. application Ser. No. 10/231,557, filed Aug. 30, 2002, now U.S. Pat. No. 7,748,039 which is incorporated by reference herein in its entirety.

BACKGROUND

The present disclosure relates generally to information handling systems, and more particularly to a method and apparatus for detection of malicious computer code in the form of a Trojan horse in an information handling system.

Trojan Horses (“Trojans”) are a particular type of malicious code. Malicious code is code that executes on an information handling system, typically a computer, but it can also be a Personal Digital Assistant or other information handling device, and is intended to damage the computer, alter the computer without the permission of the computer's user, or use the computer against the wishes of the computer's user. The Trojan horse is executable code that can exist in one of many forms. For example, some but not all of the forms that Trojans can be instantiated in executable code are as one or more programs, threads inside other programs, plugins or shared modules loaded by other programs, or modules loaded into operating system kernel memory in the manner of a device driver or loadable kernel module. A Trojan is a form of malicious code that enables a person to remotely control someone else's computer. The person who remotely controls the computer is known as the “Evil Hacker” while the person whose computer is being remotely controlled is known as the “Innocent Victim”. BackOrifice 2000, SubSeven, NetBus and OptixPro are all examples of Trojans. Trojans are sometimes referred to as “back-doors” or “hacker back-doors.”

Most Trojans have two components, the client program (Trojan Client) that is run on the Evil Hacker's computer and the server program (Trojan Server) that is run on the Innocent Victim's computer. Some Trojans have only a Trojan Server that can be remotely controlled through manually entered commands rather than through the programmatic interface of a Trojan Client.

Trojans can be used by Evil Hackers to disrupt the normal operation of the Innocent Victim's computer, to spy on the Innocent Victim, to steal money from the Innocent Victim, or to steal intellectual property from the Innocent Victim. The Evil Hacker often uses the Innocent Victim's computer to perform these malicious activities in order to harm the organization to which the Innocent Victim belongs. Trojans can thus harm computer systems, whether or not the particular computer systems belong to an individual, a company, organization, or government.

There are many ways to infect a computer with a Trojan including sending the Innocent Victim the Trojan Server disguised as a valid program, copying the Trojan Server onto the Innocent Victim's computer, or exploiting a vulnerability in the Innocent Victim's computer to place the Trojan Server on the computer.

Several techniques exist that are effective for detecting some forms of malicious code. For example, some types of malicious code can be detected by examining the binary code image of the running program or the binary image of the program when it is stored on a storage device. Many malicious code programs can be identified by a unique bit or byte pattern. The unique bit or byte pattern can comprise the entire image of the program while it is in memory or while it is stored on disk. The signature can also be a bit or byte pattern that is a portion of the program in memory or on disk. Once the unique sequence has been identified, a signature can be developed to identify the sequence. The signature is often the bit or byte pattern itself or it is in the form of a checksum. A detection program can then search for a malicious code program using the signature to identify the unique bit or byte sequence. Trojans, however, can be configured so that they have no easily identifiable signature. Trojans have configuration parameters that change the bit or byte sequences in the program and make it difficult or impossible to provide a unique signature. There are many tools available that can be used to reconfigure a Trojan so that it will not have a known signature.

Another technique used to identify malicious code examines the behavior of a Trojan Server while the Trojan Server is loaded and installed on a computer. With such a technique, a loaded and installed program is first placed into a Sandbox. The Sandbox comprises a restricted area on the computer where the program (e.g., Trojan Server) can be examined safely. While such an approach may be effective for preventing some Trojan infection, the approach does not however detect Trojan Servers once they are already installed on a computer. Such an approach does not detect many Trojan Servers because Trojans do not exhibit their most characteristic behaviors while they are being loaded or installed, but rather they come alive and exhibit their malicious behavior after they have been loaded and installed.

Accordingly, it would be desirable to provide an improved method for detecting Trojans in a computer system and overcoming problems in the art discussed above.

SUMMARY

A method for detecting malicious code on an information handling system includes executing malicious code detection code on the information handling system. The malicious code detection code includes detection routines. The detection routines are applied to executable code under investigation running on the information handling system during the execution of the malicious code detection code. The detection routines assign weights to respective executable code under investigation in response to detections of a valid program or malicious code as a function of respective detection routines. Lastly, executable code under investigation is determined a valid program or malicious code based on scores that are determined as a function of the weights assigned by the detection routines. Computer-readable media and an information handling system are also disclosed.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a system block diagram of an information handling system for implementation of the method of detecting a Trojan according to one embodiment of the present disclosure; and

FIG. 2 illustrates a detection architecture of a Trojan detection program according to one embodiment of the present disclosure.

DETAILED DESCRIPTION

In FIG. 1, a system block diagram of an information handling system 10 for implementing the method of detecting a Trojan according to one embodiment of the present disclosure is shown. Information handling system 10 including one or more of: a central processing unit (CPU) 12, memory 14, input/output (I/O) devices, such as a display, a keyboard, a mouse, and associated controllers, collectively designated by a reference numeral 16, a hard disk drive 18, or other storage devices or media drives, such as may include a floppy disk drive, a CD-ROM drive, a DVD drive, and the like, collectively designated by a reference numeral 20, or various other subsystems, such as a network interface card, wireless communication link, etc, collectively designated by a reference numeral 22, all interconnected, for example, via one or more buses, shown collectively as a bus 24. Examples of information handling system may include a computer system, a personal digital assistant, a thin client device, a thick client device, or similar information handling device.

In one embodiment, information handling system (IHS) 10 is configured with a suitable operating system to install and run executable code, programs, etc., from one or more computer readable media 26, such as a floppy disk, CD-ROM, DVD, or the like. Information handling system 10 may further be configured for communicating with another information handling system 28, for example, through a network 30 via a suitable communication link or links. The operating system of IHS 10 may also be configured to install and run programs, download data, etc., via network 30. The illustrative embodiments of the present disclosure may be practiced over an Intranet, the Internet, virtual private network, or other suitable communication network.

According to one embodiment, the method of Trojan detection is implemented in the form of computer software, the computer software comprising instructions executable by the CPU of a computer system, for example, an Innocent Victim's computer system. The instructions include suitable program code processable by the computer system for performing the various functions as described herein. The various functions as discussed herein can be programmed using programming techniques well known in the art.

A novel method for detecting Trojans includes a method for detecting a server portion of a Trojan residing on a target computer system, for example, an innocent victim computer system. Furthermore, the method identifies the server portion of the Trojan when the server portion is executing on the target computer. As briefly discussed above, most Trojans have two components, a client program (Trojan Client) that is run on an Evil Hacker's computer and the server program (Trojan Server) that is run on the Innocent Victim's computer.

Accordingly, the method of detecting a Trojan includes a procedure for detecting a Trojan Server, i.e. the portion of a Trojan that resides on the Innocent Victim's computer system. The procedure can be embodied in a computer program, for example, a Trojan detection program. The Trojan detection program detects the presence of a Trojan while it is executing on a computer.

FIG. 2 illustrates an architecture of a Trojan detection program 40 according to an embodiment of the present disclosure. The Trojan detection program 40 includes detection routines 42 and a scoring algorithm 44. The detection routines 42 operatively couple to the operating system kernel 46 of the computer system under investigation via application programming interfaces (APIs) 48. The detection routines also access process behavior data 50 and binary image data 60, according to the particular requirements of a corresponding detection routine, further as discussed below.

In one embodiment, the Trojan detection program operates as follows. The Trojan detection program executes at any time, on an as-needed basis, a periodic basis, a random basis, another scheduled basis, or on an event driven basis in response to a particular event according to the particular requirements of a given situation. When the Trojan detection program executes, it examines the characteristics and behaviors of all computer programs that are executing at the current time on the subject computer system. The Trojan detection program evaluates each computer program that is running on the computer system under investigation, to determine whether the running computer program is a valid program or a Trojan.

The Trojan detection program 40 contains detection routines 42, including valid program detection routines 52 and Trojan detection routines 54. The valid program detection routines 52 include one or more routines identified by v₁, v₂, v₃, . . . , v_(M) in FIG. 2. The valid program detection routines 52 are configured to determine whether the program under investigation has characteristics and behaviors usually associated with a valid program. The Trojan detection routines 54 include one or more routines identified by t₁, t₂, t₃, . . . , t_(N) in FIG. 2. The Trojan detection routines 54 are configured to determine whether the program under investigation has characteristics and behaviors usually associated with a Trojan.

In one embodiment, the valid program detection routines 52 and the Trojan detection routines 54 are configured to gather information about each program under investigation by examining the program itself and by looking for information about the program in the operating system 46. The detection routines 42 access information from the operating system 46 using application programming interfaces (APIs) 48 to the operating system. The API's 48 can include documented API's, undocumented API's, direct access to resources of the computer or information handling system such as memory or network connections, or kernel or device driver interfacing. The detection routines 42 gather information from the program itself by examining one or more of a binary image of the program that is stored in memory, a binary image of the program that is stored on disk or other media, the characteristics and behavior of the program, and any other related programs (such as libraries used by the program under investigation), represented by reference numerals 50 and 60 in FIG. 2.

For example, a detection routine 42 can be configured to take into account the following. Many Trojans log keystrokes on the Innocent Victim's computer and transmit the keystroke data from the Innocent Victim's computer to the Evil Hacker's computer. In one embodiment, a Trojan detection routine 54 determines whether or not the program being examined is logging keystrokes. Since there are many different ways for a program to log keystrokes, one or more of the Trojan detection routines 54 can be configured to examine the program under investigation to determine whether the program is using any of a number of different mechanisms for logging keystrokes.

The Trojan detection program 40 further includes a scoring algorithm 44. The scoring algorithm calculates two scores—a valid program score 56 and a Trojan score 58. If the result of a valid program detection routine 52 indicates that the characteristic or behavior of the program being examined was that of a valid program, then a weight, W_(i), is associated with the routine and that weight contributes positively to the valid program score 56. A weight, W_(i), is assigned to each valid program detection routine, for i=1 to M, where M is the number of the valid program detection routine.

If the result of a Trojan detection routine 54 indicates that the characteristic or behavior of the program being examined was that of a Trojan, then a weight, W_(j), is associated with the routine and that weight contributes positively to the Trojan score 58. A weight, W_(j), is assigned each Trojan detection routine, for j=1 to N, where N is the number of the Trojan detection routine.

According to one embodiment, the scoring algorithm 44 comprises an algorithm that includes an algebraic formula for determining the two scores 56 and 58. The scoring algorithm is dependent on the valid program detection routines 52 and the weights, W_(i), associated with each valid program detection routine, in addition to, the Trojan detection routines 54 and the weights W_(j), associated with each Trojan detection routine. The algebraic formula or equation can also be made arbitrarily complex, for example, to include associating weights to one or more to combinations of detection routines 42.

In one embodiment, the scoring algorithm 44 includes an algebraic equation defined as a sum of weighted values. For example, the algebraic equation for the valid program detection routines can include an equation as given by:

${{VALIDSCORE} = {\sum\limits_{i = 1}^{M}W_{i}}},$

-   -   where W_(i)=weight of a valid detection routine v_(i) for i=1 to         M.

Similarly, the algebraic equation for the Trojan detection routines can include an equation as given by:

${{TROJANSCORE} = {\sum\limits_{j = 1}^{N}W_{j}}},$

-   -   where W_(j)=weight of a Trojan detection routine t_(j) for j=1         to N.

In another embodiment, more complex forms of the scoring algorithm 44 can be implemented in the form of more sophisticated algebraic formulae.

If a program under investigation exceeds a valid program score threshold, V_(thres), then it is determined that the program is a valid program. If that program exceeds a Trojan score threshold, T_(thres), then it is determined that the program is a Trojan program. If a program is deemed to be valid using the valid algorithm, then it is sometimes removed from consideration from additional Trojan score routines.

Executable code and/or programs under investigation may also have some of the characteristics and behaviors of valid programs and some of the characteristics and behaviors of Trojans. If a program does not exceed either threshold or if a program does not have a significant difference between the valid program score 56 and the Trojan score 58, then according to another embodiment of the present disclosure, the method identifies the program in another category of Suspicious

Programs or Anomalous Programs.

In one embodiment, the method for detecting malicious code on a computer system includes executing a malicious code detection program on the computer system. The malicious code detection program includes detection routines. The malicious code detection program applies the detection routines to programs running on the computer system during the execution of the malicious code detection program. The detection routines are assigned weights that are factored by a scoring algorithm to determine a composite score based on the results of the detection routines and their associated weights. For example, a malicious code detection routine has a weight associated with it, such that if the malicious code detection routine determines that a given code under investigation is a Trojan, then the weight is applied positively towards the malicious code score for the code under investigation. Lastly, the malicious code detection program determines whether one or more programs of all programs running on the computer system during operation of the malicious code detection program is a valid program or malicious code as a function of the weights assigned to the detection routines.

In another embodiment, the method is configured to detect malicious code in the form of a Trojan horse on a computer having an operating system. The method includes executing a malicious code detection program on the computer. Detection routines of the malicious code detection program are configured to gather information about programs running on the computer during execution of the malicious code detection program. The detection routines include at least one selected from the group consisting of a) examining each executable code or program itself and b) searching for information about each executable code or program in the operating system. For example, examining code or a program can include examining a binary image of the same, wherever the binary image may reside, within the IHS or in computer readable media accessible to the IHS. In addition, the detection routines further consist of valid program detection routines and malicious code detection routines.

The malicious code detection program applies the detection routines to the programs running on the computer. In response to a detection of a valid program or malicious code, the detection routines assigns weights to respective programs under test as a function of a respective detection routine. Lastly, the malicious code detection program determines whether a program is a valid program or malicious code as a function of the weights assigned by the detection routines. Determining whether the program is a valid program or malicious code involves the scoring of an execution of each detection routine as a function of a respective weight. A scoring algorithm is used to identify a program as malicious code in response to a valid score and a malicious code score, as discussed herein.

In yet another embodiment, the method for detecting malicious code on a computer system includes executing detection routines, the detection routines having been configured to examine at least one selected from the group consisting of characteristics and behaviors of programs running on the computer system. For example, the detection routines can be configured to access process behavior data of a program running on the computer system. In addition, the characteristics and behaviors may include one or more of logging keystrokes, saving a display screen view, uploading files, downloading files, running programs, and controlling a display screen of the computer system.

Subsequent to execution of one or more of the detection routine, weights are assigned as a function of the examined characteristics and behaviors, the assigned weights indicative of a valid program or malicious code as a function of respective detection routines. Lastly, the method determines whether a program is malicious code as a function of the weights assigned by the detection routines.

In the embodiment of the previous paragraph, the detection routines include valid program detection routines and malicious code detection routines. The valid program detection routines are configured to determine whether the program exhibits at least one or more characteristics and behaviors associated with a valid program. The malicious code detection routines are configured to determine whether the program exhibits at least one or more characteristics and behaviors associated with malicious code.

In one embodiment, the method of detecting Trojans is carried out in the form of a computer program. The computer program is executed on a desired computer system for detecting any potential Trojans present on the computer system. Execution of the computer program continues until all active programs on the computer system have been tested and evaluated. Alternatively, other criteria may is be established for a duration of testing with the Trojan detection program. For example, execution of the malicious code detection program can be configured to occur in response to one or more of a random initiation and a periodic initiation.

According to another embodiment, the Trojan detection program comprises a small program configured for being delivered quickly, as well as, for being executed quickly. The Trojan detection program can be delivered to the innocent victim's computer over a network, such as a Local Area Network (LAN), Wide Area Network (WAN), Internet, intranet, or any other global computer network 30. The Trojan detection program may also be delivered via suitable computer readable media, such as, media 26 shown in FIG. 1.

While not stopping an infection of the computer system with Trojans, the method of the present embodiments identifies a Trojan when executing on a computer system. The method of identifying a Trojan could be combined with other methods, for example, a method for detecting infection, resulting in a more robust computer system malicious code protection implementation.

Although only a few exemplary embodiments have been described in detail above, those skilled in the art will readily appreciate that many modifications are possible in the exemplary embodiments without materially departing from the novel teachings and advantages of the embodiments of the present disclosure. Accordingly, all such modifications are intended to be included within the scope of the embodiments of the present disclosure as defined in the following claims. In the claims, means-plus-function clauses are intended to cover the structures described herein as performing the recited function and not only structural equivalents, but also equivalent structures. 

1. One or more non-transitory computer-readable media storing program instructions executable by an information handling system to: scan a plurality of programs currently running on the information handling system, wherein each of the plurality of programs is scanned while running on the information handling system in a manner that permits infection of the information handling system; wherein the scan includes, for each of the plurality of programs: executing a plurality of detection routines to compute a valid score and a malicious score for that program; and in response to the valid score for that program exceeding a valid score threshold value, categorizing that program as valid code, regardless of the malicious score for that program.
 2. The computer-readable media of claim 1, wherein at least one of the plurality of programs currently running on the information system is a thread.
 3. The computer-readable media of claim 1, wherein the program instructions are executable, upon completion of the execution of the plurality of detection routines for a given one of the plurality of programs, to categorize the given program into one of a set of categories indicative of the likelihood of that program compromising the security of the information handling system, wherein the set of categories includes a first category indicative of malicious code and a second category indicative of valid code.
 4. The computer-readable media of claim 3, wherein the plurality of detection routines include a first set of detection routines that determine whether the program being scanned has at least one of characteristics and behaviors typically associated with malicious code and wherein the plurality of detection routines include a second set of detection routines that determine whether the program being scanned has at least one of characteristics and behaviors typically associated with valid code.
 5. The computer-readable media of claim 1, wherein the scan includes, for each of the plurality of programs, in response to the malicious score for that program exceeding a malicious code threshold value and the valid score for that program being below the valid code threshold value, categorizing that program as malicious code.
 6. The computer-readable media of claim 1, wherein the plurality of detection routines include one or more detection routines that examine a binary image of the current program being scanned for one or more signatures.
 7. The computer-readable media of claim 1, wherein the plurality of detection routines include one or more detection routines that access an operating system of the information handling system on which the plurality of programs are running to determine information relating to the current program being scanned, wherein the access is made via one or more APIs of the operating system.
 8. The computer-readable media of claim 1, wherein the plurality of detection routines include one or more detection routines that access a memory of the information handling system to determine information relating to the current program being scanned.
 9. The computer-readable media of claim 1, wherein the plurality of detection routines include one or more detection routines that access a network connection of the information handling system to determine information relating to the current program being scanned.
 10. The computer-readable media of claim 1, wherein at least one of the plurality of programs is running in a kernel mode.
 11. The computer-readable media of claim 1, wherein the plurality of detection routines include one or more detection routines that interface with a device driver of the information handling system to determine information relating to the current program being scanned.
 12. The computer-readable media of claim 1, wherein the program instructions are executable by the information handling system to determine whether the current program being scanned is a Trojan horse.
 13. The computer-readable media of claim 1, wherein the plurality of detection routines include one or more detection routines to determine whether the current program being scanned is logging keystrokes of a user of the information handling system.
 14. The computer-readable media of claim 1, wherein the plurality of detection routines include one or more detection routines to determine whether the current program being scanned is uploading/downloading files from/to the information handling system.
 15. The computer-readable media of claim 1, wherein the program instructions are executable to perform a scan of currently running programs on an event-driven basis.
 16. A method, comprising: scanning a plurality of programs currently running on an information handling system, wherein each of the plurality of programs is scanned while running on the information handling system in a manner that permits infection of the information handling system, wherein the scanning includes, for each of the currently running programs, executing a plurality of detection routines to determine a valid score and a malicious score; and upon completion of execution of the plurality of detection routines for a first one of the plurality of programs, categorizing the first program as valid code based on the valid score for the first program exceeding a valid score threshold value, regardless of the malicious score for the first program.
 17. The method of claim 16, further comprising: upon completion of execution of the plurality of detection routines for a second one of the plurality of programs, categorizing the second program as malicious code based on the malicious score for the second program exceeding a malicious code threshold value and the valid score for the second program not exceeding the valid code threshold value.
 18. The method of claim 16, further comprising performing the scanning on an event-driven basis.
 19. The method of claim 16, wherein the plurality of detection routines include one or more detection routines that examine a binary image of the current program being scanned for one or more signatures, and include one or more detection routines that access an operating system of the information handling system on which the plurality of programs are running to determine information relating to the current program being scanned.
 20. An information handling system, comprising: a central processing unit (CPU); a memory storing program instructions executable by the CPU to: scan a plurality of programs currently running on the information handling system, wherein each of the plurality of programs is scanned while running on the information handling system in a manner that permits infection of the information handling system; wherein the scan includes, for each of the currently running programs, executing a plurality of detection routines to compute a valid score and a malicious score, and categorizing a program as valid code based on the valid score for that program exceeding a valid score threshold value, regardless of the malicious score for that program.
 21. The information handling system of claim 20, wherein the scan includes, for each of the currently running programs, categorizing that program as malicious code in response to the malicious score for that program exceeding a malicious code threshold value and the valid score for that program being below the valid code threshold value.
 22. The information handling system of claim 20, wherein the information handling system is configurable to perform a scan of currently running programs on an event-driven basis. 